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Description 

BACKGROUND OF THE INVENTION 

5 The present invention relates to a novel kaurene synthase and a DNA which encodes the enzyme. 

ent-Kaurene is an important intermediate in gibberellin (GA) biosynthesis, and is synthesized from geranylgeranyl 
pyrophosphate (GGPP) via copaJyl pyrophosphate (CPP). These steps are catalyzed by ent-kaurene synthase A (KSA) 
and ent-kaurene synthase B (KSB), respectively (Coolbaugh, R.C., The Biochemistry and Physiology of Gibberellins, 
Vol.1, Pracger Publishers, New York, pp.53-98, 1983). Other terpenoids such as carotenoids, xanthophyll, and phytol 

10 are also synthesized from geranylgeranyl pyrophosphate as a precursor, while copalyl pyrophosphate is used as pre- 
cursors for macrocyclic diterpene biosyntheses. ent-kaurene synthase A and ent-kaurene synthase B are thus impor- 
tant enzymes in the Initial stage of gibberellin biosynthesis (Coolbaugh, supra; Chung C.H. et al.. Plant PhysroK. 80, 
pp.544-548, 1986; Graebe, J.E., Annu. Rev. Plant. Physiol., 38. pp.419-465. 1987). 

Duncan et al. separated ent-kaurene synthase A and ent-kaurene synthase B from native cucumber (Marah mac- 

15 rocarpus L.) using a QAE column. The authors suggested that the conversion of geranylgeranyl pyrophosphate to ent- 
kaurene is catalyzed by the two distinguishable enzymes (Duncan, J.D. and West C.A., Plant Physiol., 68. pp.1 128- 
1 134, 1 981). In this article, it is shown that ent-kaurene synthase A and ent-kaurene synthase B closely associated with 
each other during the ent-kaurene synthesis and that ent-kaurene synthase B preferentially utilized endogenous 
copalyl pyrophosphate produced by ent-kaurene synthase A rather than exogenous copalyl pyrophosphate. 

20 As shown in the scheme set out below, ent-kaurene synthase A catalyzes the conversion of geranylgeranyl pyro- 
phosphate (GGPP) to copalyl pyrophosphate (CPP), and the resulting copalyl pyrophosphate is successively converted 
to ent-kaurene by ent-kaurene synthase B. 



KSA activity KSB activity 
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In recent years, cDNA clonings of gibberellin biosynthases have been reported. GAI locus of Arabidopsis was iso- 
lated by genomic subtraction (Sun. TP, et al.. Plant Cell, 4. pp.1 19-128. 1992) and the conesponding cDNA done was 

40 verified to encode ent-kaurene synthase A by over-expression system in E. coli (Sun TP and Kamiya, Y, Plant Cell, 6, 
pp. 1509-1 518. 1994). AnI locus Of maize was cloned by the transposon tagging technique and the deduced amino acid 
sequence from the cDNA was revealed to have significant homology with Arabidopsis ent-kaurene synthase A, 
although the function of the expressed protein remains unknown (Bensen, R.J., et al.. Plant Cell, 7, pp.75-84, 1995). A 
putative 3b-hydroxylase in gibberellin biosynthesis was also cloned by the T-DNA tagging method using Arabidopsis 

45 QA4 mutant (Chiang, H-h., et al.. Plant Cell. 7. pp.195-201. 1995). 

As for ent-kaurene synthase B, the enzyme was partially purified from Ricinus communis (Spickett, CM., et al., 
Phytochem., 37. pp.971 -973. 1994). Ricinus communis biosynthesizes kaurene and its family diterpenes. i.e.. 
bayarene. torakiroban. and sandarakopymaradiene. These compounds are synthesized through selective cyclizations 
from a common intermediate, copalyl pyrophosphate. Although a cloning of an ent-kaurene synthase B gene was 

so attempted (15th International Conference on Plant Growth Sut)stance. July 14-18, 1995, Minneapolis, USA, Kamiya, Y. 
et at.. Subject No. 040 and Saito. T, et al.. Subject No. 110), the complete sequence of a cDNA and the amino acid 
sequence of the enzyme has not been reported. 

SUMMARY OF THE INVENTION 

55 

An object of the present invention is to provide a purified ent-kaurene synthase B. Another object of the present 
invention is to provide the amino acid sequence of the enzyme and a DNA encoding the enzyme. 

The present inventors succeeded in purifying an ent-kaurene synthase B from endosperm of pumpkin (Cucurbita 
maxima L.) seeds. The inventors revealed its amino acid sequence (a partial sequence), and based on the information 
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of the amino acid sequence, they succeeded in cloning a gene which encodes ent-kaurene synthase B . 

Accordingly, the present invention provides a polypeptide which is characterized by the amino acid sequence as 
defined by SEQ ID NO:1 of the sequence listing (amino acids No 1-789: the amino acid sequence is shown by the 
lUPAC one-letter symbols). The polypeptide can act as an enzyme having the catalytic activity of converting copalyl 
5 pyrophosphate to ent-kaurene (in the specification, this enzyme will be occasionally referred to as ent-kaurene syn- 
thase B , or more symply as KSB). According to another embodiment of the present invention, DMAs encoding the 
above-described polypeptide are provided. As a preferred embodiment thereof, a DNA which is characterized by nucle- 
otide Nos. 138-2507 (including the termination codon) of the nucleic acid sequence as defined by SEQ ID NO:2 of the 
sequence listing is also provided. 

10 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a chromatogram profile of the ent-kaurene synthase 8 of the present invention applied to hydrophobic 
interaction chromatography. 

IS Fig. 2 is a chromatogram profile of the ent-kaurene synthase B of the present invention purified by a DEAE column. 
Fig. 3 is a chromatogram of an ent-kaurene synthase B of the present invention purified by a hydroxyapatite col- 
umn. 

Fig. 4 shows activities of an ent-kaurene synthase B of the present invention in the presence of divalent cations. 

20 DETAILED EXPLANATION OF THE INVENTION 

In addition to the aforementioned naturally derived ent-kaurene synthase B from pumpkin, any polypeptides also 
fall within the scope of the polypeptides provided by the present invention which have the amino acid sequence difined 
by SEQ ID NO:1 of the sequence listing together with insertion, deletion, and/or substitution by one or more amino acid 

25 residues and which have the catalytic activity of converting copalyl pyrophosphate to ent-kaurenethe. Particularly pre- 
ferred embodiments of the polypeptide of tiie present invention include naturally-derived ent-kaurene synthases 
expressed in plant bodies such as pumpkin, maize, rice, or soybean. 

Detailed descriptions will be given in the examples of the specification as for the evaluation of the enzymatic func- 
tion of the naturally-derived ent-kaurene synthase B from pumpkin, which is a preferred embodiment of the polypeptide 

30 of the present invention. Accordingly, it can be readily understood that an ordinarily skilled artisan can easily determine 
whether or not a variant polypeptide, having insertion, deletion, and/or sut>stitution of one or more constituent amino 
acid residues of the amino acid sequence of said enzyme, has the desired activity. 

According to another embodiment of the present invention, polypeptides are provided which contain in their mole- 
cules amino acid sequences of the polypeptide of the naturally-derived ent-kaurene synthase B from pumpkin or variant 

35 polypeptides thereof and which have catalytic activities of converting copalyl pyrophosphate to ent-kaurene. These 
polypeptides are characterized by the entire sequences of the above-described polypeptides contained as a part 
thereof. 

According to further embodiments of the present invention, polypeptides having catalytic activities of converting 
copalyl pyrophosphate to ent-kaurene are also provided which consist of, or alternatively comprise as a part thereof, a 

40 partial amino acid sequence of the polypeptide of the naturally-derived ent-kaurene synthase B from pumpkin or its var- 
iant polypeptide wherein said partial sequence has the catalytic activity of converting copalyl pyrophosphate to ent-kau- 
rene, i.e.. being a so-called active domain. Such peptides may consist solely of the active domain, per se, or may 
comprise the active domain as a part thereof. 

For example, the naturally-derived ent-kaurene synthase B obtained from punnpkin. which is a preferred embodi- 

45 ment of the present invention, may occasionally be retained inside the cells after expression. A membrane-binding 
domain (a hydrophobic domain) may be selectively removed from the entire amino acid sequence of the enzyme and 
the remaining polypeptide containing an active domain can be ligated to a signal peptide to produce an extracellularly 
releasable soluble enzyme. For the determination of the hydrophobic domain, methods well known in the art, e.g., the 
method of Kyte et al, (Kyte. J. et al.. J. Mol. Bio.. 157. pp-105-132. 1982), the Hopp & Woods method and the like can 

50 be employed. 

According to further embodiments of tiie present invention. DNAs which encode any one of the above-described 
polypeptides are provided. For example, the present invention encompasses DNAs encoding the aforementioned nat- 
urally-derived ent-kaurene synthase B from pumpkin, and as a preferred example, the DNA characterized by nucle- 
otides of from No. 138 to 2507 (including the termination codon) of the nucleic add sequence as defined by SEQ ID 
55 NO:3 of the sequence listing. 

Furthermore, examples of DNAs that fall within the scope of the present invention include DNAs which encode a 
variant polypeptide having the catalytic activity of converting copalyl pyrophosphate to ent-kaurene wherein said 
polypeptide includes insertion, deletion, and/or substitution of one or more of the constituent amino add residues of the 
amino acid sequence of tiie naturally-derived ent-kaurene synthase B from pumpkin; and DNAs which encode a 
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polypeptide having the catalytic activity of converting copalyl pyrophosphate to ent-kaurene wherein said DNA has a 
DNA sequence characterized by the nucleic acids of from 138 to 2507 (including termination codon) of the nucleic acid 
sequence as defined by SEQ ID NO:3 of the sequence listing in which one or more of the constituent nucleotides are 
inserted, deleted, and/or substituted. 

5 DNAs which act as antisense DNAs against the gene containing the aforementioned sense DMAs also fall within 

the scope of the present invention. One example of such antisense DNAs is a DNA which acts as an antisense DNA 
against an ent-kaurene synthase B gene comprising a DNA encoding any one of the aforementioned polypeptides 
wherein said DNA sequence includes insertion, deletion, or substitution of one or more of the constituting nucleotides 
of the sequence that is complimentary to the DNA characterized by nucleotides of from No. 138 to 2507 (including the 

10 termination codon) of the nucleic acid sequence defined as SEQ ID NO:3 of the sequence listing. 

As for the DNA encoding the naturally-derived ent-kaurene synthase B from pumpkin, a preferred example of the 
DNAs of the present invention, method for the isolation thereof will be detailed In the examples of the specification. 
Therefore. DNAs encoding ent-kaurene synthase Bs derived from pumpkin and other plants can be readily obtained by 
those skilled in the art by following the methods described in the examples, or alternatively, by referring to the descrip- 

15 tion and optionally applying appropriate modifications or alterations to the described methods. Furthermore, it should 
be understood that DNAs containing any one of the aforementioned DNAs as a part thereof also faW within the scope 
of the present invention. 

The DNAs of the present invention and antisense DNAs against said DNAs are useful fbr the regulation of the 
expression of ent-kaurene synthase B in plant that has important functions in the growths of plants and seeds. For 
20 example, a transgenic plant introduced with the DNA of the present invention may have an increased height, and its pro- 
moted growth can be expected. A transgenic plant Introduced with the antisense DNA may have a reduced hight while 
the quantities of seeds and fruits can be maintained. Therefore, a plant bearable against a damage from a storm and 
flood can be obtained. 

The present invention also provides a recombinant vector containing the aforementioned DNA and a transformant 
25 cell containing the recombinant vector. Although the types of the vectors are not particularly limited so long as available 
to those skilled in the art, an appropriate vector should be chosen depending on a combination with a host cell to be 
transformed. Sorts of the host cell are also not particularly limited and any cells such as microbial cells (prokaryote), 
e.g.. E. coli and animal or plant cells (eukaryote) can be used. 

30 EXAMPLES 

The present invention will be more specifically explained by referring to the examples. However, the scope of the 
present invention is not limited to the examples set out below. 

35 Example 1 : Purification of the ent-kaurene synthase B (KSB) 

(1) Materials and methods 

(A) Plant materials and chemicals 

40 

Seeds of pumpkin (Cucurbita maxima L) cv. "Riesenmelone gelb vernetzt" were obtained from van Waveren Rlan- 
zenzucht, Rosdorf. Germany through Professor Jan Graebe. Plants were cultivated in a field in Saitama. Japan, and 
immature fruits were harvested between the middle of June and early July in 1992, when the cotyledons had reached 
about half their final length 

45 Geranylgeraniol was a gift from Dr. T. Takigawa of Kurare Co. Ltd. (KurashiW. Japan). Copalic acid (Nakano. T and 
Djerassi C, J. Org. Chem., 26, pp. 167-1 73. 1961) and natural resin "Brazil copal" (which contains copalic acid) were 
gifts from Prof. T. Nakano of Venezuela National Institute of Science (Caracas, Venezuela) and Prof. Y. Ichinohe and Dr. 
H. Sakamaki of Nihon University, respectively. [^H]Sodium borohydride (296 GBq mmol*^) was purchased from Amer- 
sham Int. (Buckinghamshire, UK). [1 -^H]geranylgeraniol (74 GBq mmol'^) and [15-^H]copalol (74 GBq mmol"^) were 

50 prepared by reduction of geranylgeranlal and copalal with [^H]sodium borohydride (Bensen, R.J. and Zeevaart, J.A., J. 
Plant Growth Regul., 9. pp.237-242, 1990). . 

Tritium labeled geranylgeranyl pyrophosphate (GGPP) and copalyl pyrophosphate (CPP) were prepared through 
chlorination and subsequent pyrophosphorylation (Davisson et al.. J. Org. Chem.. 51 , pp.4768-4779. 1986). The pyro- 
phosphates were purified by reverse-phase chromatography on a Bond-Elut CIS cartridge (1 mL. Varian, Harbor City, 

55 CA) with stepwise elution using 20, 30, 50 and 70% (v/v) aqueous methanol. Optically pure uniconazole (Izumi, K., et 
al.. Plant Cell Physiol.. 26. pp.821-827, 1985) was obtained from Sumitomo Chemical Co. 
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(B) Enzyme assay and protein assay 

KSB activity was measured in the incubation mixtures consisted of enzyme solution in Kpi buffer (50 mM« pH 8.0. 
100 \i L) containing glycerol (10%, v/v), dithlothreitol (DTT) (2 mlVI), MgCl2 (5 mM), uniconazole (20 \i M, Izumi, K., et 

5 aL. Plant Cell Physiol.. 26. pp.821 -827. 1985) and [IS-^HJCPP (1.0 kBq). [1-^H]GGPP (1.0 kBq) was used as a sub- 
strate for the measurement of KSAB activity (conversion from geranylgeranyl pyrophosphate (GGPP) to ent-kaurene). 

The mixture was Incubated at 30 *C for 30 min. and the enzyme reaction was terminated by the addition of acetone 
(200 \i L) and water (100 ji L). The mixture was extracted with n-hexane (400 \l L), and 300 \i L aliquots of the extract 
were concentrated in vacuo and applied to a silica gel plate. After developing with n-hexane. the radioactivity of the sW- 

10 ica gel of the ent-kaurene region (Rf 0.6-1 .0) was determined. One unit of kaurene synthase activity was defined as the 
production of 1 pmol of ent-kaurene per minute. The protein concentration was determined using BIO-RAD (RichnfK>nd, 
CA) microassay (Bradford. M.M.. Anal. Biochem.. 72. pp.248-245. 1976) using BSA as the standard. 

(C) Purification of KSB 



All procedures were performed at 0 to 40°C. The following t>uffers were used: buffer A, 50 mM KPi buffer (pH 8.0) 
containing 2 mM DTT; buffer B. buffer A containing 10% glycerol; buffer C, 20 mM Tris-HCI buffer (pH 7.4) containing 
10% glycerol and 2 mM DTT; buffer D, 1 mM KPI buffer (pH 6.0) containing 20% glycerol. 2 mM DTT and 5 mM MgC^. 
Crude enzyme extract (1 L) was prepared by the method of Graebe et al. (Graebe. J.E., et al.. Pytochem.. 13, 

20 pp.1433-1440. 1974). Butyl-Toyopeart 650S gel (Toso, Tokyo. Japan) was packed into a glass column (AP-1 , 10 mm X 
100 mm, Waters. Millipore Corp.. Mitford. MA) and the column was equilibrated with a buffer A containing 1.7 M 
(NH4)2S04. Ammonium sulfate was added to the crude enzyme preparation (77 mL) to 1 .7 M of a final concentration, 
and stirred for 1 0 min. The enzyme solution was centrrfuged at 2.000 g for 10 min, and the supernatant was loaded onto 
the column at a flow rate of 3.0 mUmin. The column was eluted with a 30-min linear gradient of 1 .7 to 0 M (NIH4)2S04, 

2S and then with buffer B for 45 min. The fraction volume was 6 mL and a 5 p. L aliquot of each fraction .was assayed. 

Tlie major active fractions (No. 32-37) were concentrated by ultrafiltration (YM-10 filter, Amicon. Beverly, MA) and 
the active fractions from several runs were pooled (53 mL). The butyl-purified enzyme preparation equilibrated with 
buffer C (8.4 mL) was loaded onto a DEAE-8HR column (10 mm X 1 90 mm, Waters). The column was eluted with a 50- 
min linear gradient of 0 to 500 mM NaCI in buffer C. The flow rate was 1 .0 mL/min and the fraction volume was 2.0 mL. 

30 The active fractions (No. 33-38) were pooled and concentrated by ultrafiltration (YM-10). Ammonium sulfate was 
added to the DEAE-purrfied enzyme preparation (7.9 mL) to 1 .7 M of a final concentration. The enzyme solution was 
loaded onto a TSK phenyl-5PW column (7.5 mm X 75 mm, Toso), which was equilibrated with buffer B containing 1.7 
M (NH4)2S04. The column was eluted with a 60-min linear gradient of 1 .7 to 0 M (NH4)2S04. and then with buffer B for 
30 min. The flow rate was 1 .0 mL/min and 2.0 mL fractions were collected. The major KSB activity was found in fractions 

35 No. 37-41 . For the second DEAE ion exchange chromatography, the phenyl-purrfied enzyme preparation (2.0 mL) was 
charged to the DEAE column and eluted as described above. 

The fractions containing most of the KSB activity (No. 36-38) were concentrated to 300 L by ultrafiltration (Centi- 
con 30, Amicon). This preparation was charged to a hydroxyapatite column (TON EN TAPS-050805 HG, 8.0 mm X 50 
mm, Tonen, Tokyo, Japan) equilibrated with buffer D. The column was eluted with a 60-min linear gradient of 1 to 500 

40 mM phosphate. TTie flow rate was 1 .0 mUmin and the fraction volume was 1 mL. The major KSB active fractions (No. 
23-25) were concentrated by ultrafiltration (Centricon 30) and stored at -80°C (70 g . 673 units). 

(D) Gel permeation chromatography 

45 The phenyl-purified enzyme preparation was charged onto the gel permeation column (TSK G3000SWx1 . 7.8 mm 
x 300 mm, Toso. Tokyo, Japan) which was pre-equilibrated with buffer B containing 200 mM NaCI. The column was 
eluted with the same buffer at a flow rate of 0.4 mL/min and fractions were collected every 30 sec. 



Active enzyme fractions of the sequential purification steps were analyzed by SDS-PAGE using a 7.5% (w/v) gel 
(Laemmli, U.K.. Nature. 227. pp.680-685. 1970). Approximately 3 |li g of total protein was applied to each lane. After 
electrophoresis, proteins were visualized by Coomassie Brilliant Blue R250 (CBB) staining. Ovalbumin (45.0 kD). BSA 
(66,2 kD), phosphorylase B (97.4 kD), B-galactosidase (1 16 kD), and myosin (200 kD) were used as standard proteins 
55 (BIO-RAD). 

(F) Properties of KSB 

When the properties of KSB were examined, the phenyl-purified enzyme preparation (0.47 n g of total protein) was 
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(E) SDS-PAGE 
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used as the enzyme solution. The amount of [1-^H]CPP was 2.2 kBg and the incubation period was 15 min. The other 
conditions were the same as those for the enzyme assay. To determine the optimal pH, KPi buffer (50 mM) adjusted to 
a particular pH (pH 4.5 to 9.0) was used. To determine the K^^ value, the phenyl-purif ied enzyme preparation (1 .2 g 
of total protein) was used, and the concentrations of [1 -^H]CPP were in the range of 7 to 480 nM. The production of ent- 
5 kaurene was calculated and the value was determined from a Uneweaver-Burk plot. 

(G) N-terminal amino-add and internal peptide sequence analysis 

The final purified enzyme preparation (14.5 \l g ) was purified on a 7.5% (w/v) polyacrylamide gel. Proteins were 
10 electroblotted to a polyvinylidene difluoride membrane and visualized (Matsudaira. P.. J. Biol. Chem., 262. pp. 10035- 
10038), 1987). The 81 kD band on the membrane was cut out and the amino-add sequence of the protein was ana- 
lyzed by a protein sequencer (477A protein sequencer. Applied Biosystems Inc.. Foster City. CA). 

For internal peptide sequence analysis, the 81 kD protein was digested in situ (Kawasaki. H., etal., Anai. Biochem., 
191 , pp.332-336, 1 990) with the Acromobacter protease t (a gift from Dr. T. Masaki of Ibaraki University). The generated 
IS peptide fragments were extracted from the gel and separated by reverse phase HPLC using a Superspher RP-Select 
B column. Aqueous 0.09% (vA^) TFA and 80% (v/v) acetonitrile containing 0.075% (v/v) TFA were used as eluents A 
and B, respectively. The flow rate was 0.2 mUmin and a 32-min linear gradient of 0 to 80% of eluent B was used (1090M 
HPLC system. Hewlett Packard, Avondale. PA). Absorbances at 215. 254. 275 and 290 nm were monitored. Amino acid 
sequences of separated peptides were analyzed with the protein sequencer. 

20 

(H) Polyclonal antibodies for the 81 kD protein 

A major peptide fragment of 81 kD protein, ASQIITHPDESVLENINSW T (shown in the single character code), was 
custom synthesized by Quality Controlled Biochemicals Inc. (Hopkinton, MA). The synthetic peptide (3.1 mg, 1.4 nmol) 
25 was combined to keyhole limpet hemocyanin by using an Imjet Immunogen EDC Conjugation Kit (Pierce. Rockfbrd, IL) 
according to the manufacturer's instructions. The combined protein (1 mg).was injected into a rabbit three times with 
inten^als of 2 weeks, and 4 weeks after the last injection antiserum was collected by centrifugation of the blood. Immu- 
nogloblin G was purified from the antiserum using a HiTrap Protein G column (1 mL, Pharmada LKB biotechnology, 
Uppsala. Sweden). 

30 About 10 ^ L of each fraction of the 2nd DEAE-chromatography was resolved on two 7.5% SDS-PAGE gels. One 
of the gels was stained with CBB solution, and proteins on the other gel were electroblotted onto a nitrocellulose mem- 
brane. The nitrocellulose membrane was treated with the IgG at room temperature for 2 hr and then with alkaline phos- 
phatase conjugated anti-rabbit- IgG at the same temperature for another 2 hr. The membrane was treated with nitro-blue 
tetrazolium chloride and 5-bromo-4-chloro-3'-indolylphosphate (Pierce) to visualize proteins. 

35 

(2) Results 

(A) Purification of KSB 

40 The crude enzyme prepared from immature seeds was first purified by a hydrophobic interaction chromatography 
in order to remove metal ions and GAs. and to concentrate the enzyme. The majority of the KSB activity was eluted at 
an (NH4)2S04 concentration of 0.6-0.1 M. whereas the majority of the KSAB ativity was eluted after the (NH4)2S04 gra- 
dient was complete (Fig. 1). Fractions corresponding to an (NH4)2S04 concentration of 0.6-0. 1 M were collected as the 
butyl-purified KSB preparation, although this preparation contained some KSA activity. 

45 The butyl-purified KSB preparation was purified further by DEAE ion exchange chromatography with a NaCI gradi- 
ent. Both KSA and KSB co-eluted at fractions corresponding to 240-360 mM NaCI. Separation of KSA and KSB was 
not achieved by DEAE chromatography but the specific activity of KSB was increased (Table 1). These fractions were 
collected and were further purified by high performance hydrophobic interaction chromatography using a Phenyl-5PW 
column. The majority of KSB activity eluted at (NH4)2S04 concentrations of 0.5-0.2 M, and KSAB activity eluted later. 

50 About 40% of the phenyl-purif ied enzyme preparation was further purified on the same DEAE column. The main 
peak of KSB activity was detected in fractions of 250-280 mM NaCI (Rg. 2), which were collected as the second DEAE- 
purified enzyme preparation. This enzyme preparation contained four major proteins by SDS-PAGE. and was further 
purified on a hydroxyapatite column. The KSB ativity was eluted at phosphate concentrations of 90-140 mM. and the 
peak shape of KSB activity was corresponded with that of UV-absortance at 280 nm (Fig. 3). The purest KSB prepa- 

55 ration did not show KSAB activity suggesting that KSA was not present in the preparation. ent-Kaurene synthase B was 
purified 291 -fold from endosperm of Cucurbita maxima L (Table 1) and 70 ^ g of purified enzyme was obtained. 
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Table 1 



Enzyme prepara- 
tion 


Total protein (nng) 


Total activity 
(units) 


Yield (%) 


Specific activity 
(units/mg) 


Purification (fold) 


Crude enzyme 


530 


17,500 


100 


33.0 


1 


Butyl-purified 


168 


11,000 


63.8 


65.4 


1.98 


1st DEAE-purlfied 


13.9 


5,170 


.29.6 


372 


11.3 


Phenyl -purified 


4.43 


3.340 


19.2 


754 


22.8 


2nd DEAE-purified 


0.407 


2,820 


16.2 


6.930 


210 


HA-purified 


0.070 


673 


3.87 


9.610 


291 
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(B) SDS-PAGE analysis and molecular mass estimation 

20 

The hydroxyapatite-purified enzyme preparation showed a major band at apparent molecular mass of 81 kD. The 
amount of protein corresponding to the 81 kD band estimated by SDS-PAGE in each fraction of the second DEAE chro- 
matography correlated with KSB activity. To determine the molecular mass of KSB by means of gel permeation HPLC, 
we used a TSK G3000 SWxl column. On this column, the etution profile of KSB was broad but centered on retention 
25 time of 23.5 min, which corresponds to an apparerrt molecular mass of approximately 90 kD. 

(C) Properties of KSB 

The properties of KSB were examined with the phenyl-purif ied enzyme preparation. The optimal pH for KSB was 
30 6.8-7.5 in 50mM KPi buffer. The K^ value for [1 5-^H]CPP was 0.35 [i M. In the presence of 5 mM Mg^* KSB could con- 
vert CPP to ent-kaurene, but was not active without exogenous cations. Since EDTA (1-100 mM) did not change KSB 
activity, the phenyl-purif ied enzyme preparation did not contain endogenous cations that affected KSB activity. 

Some divalent cations (0.1-50mM. as chlorides) were added to the incubation mixture instead of 5 mM Mg^*^, and 
KSB activity was measured. As shown in Fig. 4. Mg^***. 00^*** and Mn^^ promoted KSB activity and their optimal concen- 
ts trations were around 10 mM. Ni^*^ and Fe^*^ weakly promoted KSB activity, while Cu^"^ , Ca^"*^ and Ba^^ showed no 
effects. When Cu^* . Ca^* and Ba^* were added to the incubation mixture in combination with 5 mM Mg^*. all three 
inhibited KSB activity promoted by Mg^*^. In particular Cu^*^ completely inhibited the conversion promoted by 5 mM 
Mg^*^ at concentrations above 0.5 mM. 

40 (D) Peptide sequence analysis 

The N-terminal peptide sequence of the 81 kD protein from SDS-PAGE was analyzed and the N-terminus of the 
protein was fburxj to be blocked. The 81 kD protein band was cut out from the polyacrylamide gel and treated with a 
lysyl endopeptidase. Acromobacter protease I (Masaki, T. et al., Biochem. Biophys. Acta, 660, pp.44-50, 1981). The 
45 resulting peptides were separated by reverse-phase HPLC and the sequences of the major peptides analyzed (Kawa- 
saki, H., et al., Anal. Biochem., 191, pp.332-336, 1990). Sequences and approximate amounts of the peptides recov- 
ered are (1 ) ASQIITHPDESVLENINSWT (74 pmol). (2) EAEWSTNK (45 pmol). (3) RAMESYSGDIVRISK (37 pmo\), (4) 
HGLSSDSVW (24 pmol), (5) LQDWSMVMQYQRK (17 pmol). 

so (E) Western blot analysis and immunopredpitation 

Western blot analysis showed that the polyclonal antibody raised against the synthetic peptide ASQIITHPDESV- 
LENINSWT. combined to keyhole limpet hemocyanin, selectively recognized the 81 kD protein in each fraction from the 
2nd DEAE-chromatography, and the pattern showed a good correlation with the patterns of KSB activity and CBB stain- 
55 ing. In an immunoprecipitation experiment using the phenyl-purified enzyme and the polyclonal antibody, the 81 kD pro- 
tein was detected in the supernatant fraction and not in the precipitate regardless of the amount of antibody used. 
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Example 2: Cloning of ent-kaurene synthase gene 

(A) Isolation of the KSB cDNA done 

5 From the peptide sequences ([1] AS QIITHPD ESVLENINSWT, (2) EAEWSTNK, [3] RAMESYSGDIVRISK. [4] 

HGLSSDSVW. [51 LQDWS MVMQYQ RK). degenerated oligonucleotides were synthesized for PGR primers in both 
sense and antisense directions. Using these synthetic oligonucleotides as primers, (RT)-PCR was tried to amplify 
cDNA fragments from poly(A) * RNA prepared from immature cotyledons of Cucurbita maxima. 

As the result, a 472-bp cDNA fragment was amplified when a combination of the fbiward primer designed from the 

10 underiined part of the sequence [5] and the reverse primer from the underiined sequence in [1] was used as a primer 
set. The amino acid sequence showed relatively high homology with KSA of Arabidopsis (39% identity, 66% similarity). 
Because the amino acid sequence of this region of putative maize KSA (Bensen. R.J. et al., Plant Cell, 7. pp. 75-84. 
1995) has much higher homology with Arabidopsis KSA, it was suggested that the PGR amplified fragment was not a 
part of KSA, but that of one of other terpene cyclases. 

15 Using the PCR-amplified fragment as a probe. RNA blot analysis was performed against poty(A)'*^RNA prepared 
from the same material to detect a 2.7-kb transcript a possible size compared to the estimated size of the purified pro- 
tein (81 kD). Considering these results together with the fact that the final purified enzyme fraction (the hydroxyapatite 
purified fraction as described above) used for sequencing did not contain KSA activity but had KSB activity, it is strongly 
suggested that the 472-bp PCR-amplif ied fragment encodes a part of KSB cDNA. 

20 To obtain a full-length KSB cDNA done, a cDNA library consisting of 1.2 million irxlependent clones was con- 
structed from poly(A)* RNA prepared from immature cotyledons of Cucurbita maxima using X ZAP 11 as a vector. After 
the amplification of the library, 5x10^ clones were screened to isolate 9 independent clones after 3 rounds of purifica- 
tions. After the subcloning into pBluescriptSK(-), the size of the insert cDNA was determined to select 2 clones, pKB3 
and pKB16 for further studies. Both of the clones contained 2.4-kb insert and showed same restriction enzyme maps. 

25 Comparison of the size of the cDNA insert and that of the transcript detected in Northern blot (2.7 kb) implied that 
the inserts of pKB3 and pKBIS are S'-truncated DMAs. The method of rapid amplification of cDNA ends was used for 
the isolation of the 5*-region of pKBIS, and a cDNA fragment containing an additional 263 bp in 5'-region was 
Isolated, which was fused to pKBIS by use of PstI site locating at nudeotide 722. The resulting plasmid clone contain- 
ing the fused cDNA was named pKB20. 

30 

(B) Sequencing analysis of pKB20 

pHB 20 contains a cDNA insert consisting of 2658 nucleotides, which contains the identical sequence (801 to 
1269) to the PGR amplified fragment which was used as a probe. The nucleotide sequence of the full length cDNA is 

35 given in SEQ ID NO:2 of the sequence listing. The KSB code region of the cDNA and the amino acid sequence of the 
corresponding KSB are given in SEQ ID NO:3 of the sequence listing. The amino acid sequence of KSB is shown in 
SEQ ID NO:1 of the sequence listing according to the lUPAC single character coding. The ATG codon at the position of 
138 to 140 is most likely the starting codon for KSB protein, which is followed by a long ORF of 2367 nucleotides and 
polyadenylation signal sequence (AATAAA) at the position of 103 to 109 nucleotides downstream from the translational 

40 stop codon. 

The ORF consists of 789 amino acids, where all the peptide sequences obtained from the protease-digested puri- . 
fied protein are detected, although there was a change of amino acid in [5] LQDWSMVMQYQRK to LQD WDM- 
VMQYQRK. The first 50 N-terminal amino acid sequence is rich in serine (1 8%) and threonine (1 2%) and the estimated 
pi of this region Js 9.8, whjch are common characters of transit peptides targeting chloroplasts (Keegstra, et al., Annu. 

45 Rev. Plant Physiol. Mol. Biol.. 40, 471 -501 , 1 989). 

As shown in above example 1, the polyclonal antibody raised against the synthetic oligopeptide, [1] ASQIITHP- 
DESVLENINSWT, combined to a carrier protein did not recognize the native KSB protein, although it recognized a 81 
kD protein on SDS-PAGE. From the results, it was predicted that the peptide sequence located not on the surface, but 
in the core part of the properly folded protein, so that the antibody could recognize only the denatured protein. In the 

50 sequence of above, the peptide sequence used for the production of the antibody is found in just the middle of the 
sequence (nucleotide 1244 to 1304). not in the N-terminus nor C-terminus region, which are supporting the experimen- 
tal results of Example 1 . 

(c) Expression of KB20 protein in Escherichia ooli 

55 

To confirm whether pKB20 encodes KSB, the protein encoded by the cloned cDNA was over-expressed as a fusion 
protein with maltose binding protein (Guan, et al.. Gene. 67, 21-30, 1987) in Escherichia coli to analyze the enzyme 
activity of the expressed protein. Plasmid pMAL-KB containing a fragment (tac pronx>ter sequence-malE-KSB) for the 
expression of the fusion protein (MBP-KB) and the control plasmid pMAL (42 kD for MBD expression) containing tac- 
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promoter sequence-malE fragment were prepared. 

In this system, the expression of the fusion protein is under the regulation of tac promoter, which is induced by the 
addition of isopropyl-b-D-thiogalactopyranoside (IPTG). On an SOS-PAGE followed by Coomassie blue staining, a 
band of an expected size (131 kD) of the fusion protein (MBP-KB) was delected in total protein extract of E. coll which 
5 carries pMALKB after IPTG induction. Immunoblot analysis was performed to confirm whether the IPTG-lnduced pro- 
tein is recognized by the anybody against the synthetic peptide. Immuno-reactivity was detected on the band corre- 
sponding to the IPTG-lnduced protein, and additional bands of smaller size were also detected, which seemed to be 
the degradation products of the KB20 protein. However, no immuno-staining was observed in the protein of E. coll pro- 
ducing MBP only, regardless of the addition of IPTG. 

10 

(D) Functional analysis of KB20 protein 

To analyze the enzyme activity of MBP-KB. soluble proteins were prepared from E. coli producing the fusion pro- 
tein. But only a trace amount of the fusion protein was found In the soluble fraction, suggesting that the most of the 

IS fusion protein formed an inclusion body, when the E. coli was cultured under normal conditions. Bishal et al. (Bishal, et 
al., J. Bact. 169, pp.51 40-51 51 , 1987) reported that when E. coll is cultured at a low temperature after the induction by 
IPTG, the ratio of the over-expressed fusion protein in a fraction of soluble proteins increased. In the case of MBP-KB, 
at 20''C, the ratio of MBP-KB protein in soluble fraction raised to atx>ut 10 folds compared to E. coli cultured at 37 ""C. 
The tennperaturefits a physiological condition of the plant which Is the origin of the enzyme, thus Is seemed to help the 

20 stability of the enzyme to culture E. coll at a low temperature (20''C). 

The conversion to ent-kaurene was analyzed using the soluble protein extract of E. coll grown at 20 ""C after the 
addition of IPTG. according to the KSB enzyme assay as the above Example 1 and adding [^H]CPP to the extracts of 
E. coli which carries pMALKB or pMAL After the incubation of [^H]CPP with tiie E. coli extracts in the presence of Mg2+ 
(cofactor for KSB of pumpkin) and uniconazole (an inhibitor of ent-kaurene oxidation), the reaction mixture was 

25 extracted with n-hexane followed by fractionation on silica-gel thin-layer chromatography (TLC) to isolate LP (low polar- 
ity) fraction, where ent-kaurene is fractionated. 

KSB activity was tentatively defined as the radioactivity (dpm) of the "LP fraction". The extract of E. coli producing 
MBP-KB20 showed significant KSB activities, athough that of the bacteria cells expressing only MBP had no activities. 
It was analyzed on OSD-HPLC to confirm whether the radioactivity in "LP fraction" on TLC corresponds to ent-kaurene 

30 The retention time of the major peak In LP fraction of the E. coll extract was Identical to tiiat of the original pumpkin 
enzyme preparation, besides, those of them are identical to retention time of authentic [^"^CJent-kaurene. It was also 
proved that the radioactivity in LP fraction of the E. coli extracts were mostiy derived from the major peak (95%) identical 
to ent-kaurene fraction. Thus, it was Indicated that the extract of E. coli producing MBP-KB had enzyme activity to con- 
vert CPP to ent-kaurene. 

35 According to the results of Northern blotting, KSB transcripts were detected in all of the tested tissues and this fact 
indicates that the KSB could be expressed not only in immature seeds but in vegetative tissues. On the other hand, a 
message of 20-oxidase of pumpkin is detected only in Immature seeds in contrast to KSB (Lange. T. et al., Proc. Nati. 
Acad. Scl. USA, 91 . pp.8552-8556. 1 994). The expression level of KSB is relatively high in young tissues and the tran- 
scripts were detected most abundantiy in growing cotyledon on growth of immature seeds. 

40 

(E) Sequence comparison with other terpene cyclases 

The deduced amino acid sequence of KB20 and sequences of other terpene cyclases, including KSAs, was ana- 
lyzed with comparison to each other. KB20 sequence shared homology with tobacco sesquiterpene cyclase (Facchini, 

45 P.J., Proc. Nati. Acad. Scl. USA, 89. pp. 11 088-1 1092. 1992), limonene synthase (Colby. S.M., J. Biol. Chem.. 268. 
pp.2301 6-23024. 1993). caster bean casbene syntiiase (Colby. S.M.. J. Biol. Chem., 268. pp.2301 6-23024, 1993) and 
in particular. Arabldopsis KSA (Sun TP. and Kamiya. Y, Plant Cell, 6, pp.1509-1518, 1994). 

Around the center part of the sequences (amino acid 337 to 600) are well conserved In all the terpene cyclases 
analyzed (Sun TR and Kamiya. Y, Plant Cell. 6. pp.1509-1518. 1994). Arabldopsis KSA and KSB20 shared the iden- 

50 tical amino acids in the first part of the sequences (amino acid 1 14 to 230. 34.8% identity), although common amino 
acid between the KSA and the KSB are hardly observed in the last part of the sequences (amino acid 601 to tiie termi- 
nus). However, KSB and cyclases other than KSA shared some conserved amino acid in the region of amino acid 603 
to 631. 

55 



9 



EP 0 768 381 A2 



SBQOENCS LISTING 

(1) GEISBRAL XSIFORMATXON ; 

(i) APPLICANT: INSTITUTE OF PHYSICAL AND CHEMICAL RESEARCH 

(ii) TITLE OP INVEUTION: KAURENE SYNTHASE 
(ill) NUMBER OP SBQUENCES: 3 

<2) INFORMATION FOR SEQ ID NO:l: 
(i) SEQUBHCB CHARACTERISTICS: 

(A) I^GTH: 789 amino acids 

(B) TYPEt amino acid 

(C) TOPOLOCT: linear 
<ii) MOLBCOLE TYPE: protein 

(iii) SBQOBMCE DESCRIPTION sSEO ID N0:1: 
im.SRPTGVARPAASSSSSSSASLFP6VDVDriTKTGALHFEETKERIKKIfD 
AYDTA)(^AMVPSPNSLNQPLFPBCINWVIJ3S0HADGSS9GLLHNDQU^^ 
TIja(WNlGHDHHSlUaJ3PIK5NIASATD£NQRSPVGFDZZFPGMIBYARDI.^^ 
>3VDALVRKKELEIASCRSNSBG6KAYIAYVSBGIGRIODWI)^^ 

AAAFMKBNDDGCPDYUiSLIiaKFDGSVPTIYPIJ>IYARLHHVDSIjQKFGIARBFKEEI^ 
\n;4DFrYRCWMQGBBNIFrJ3ASTCAMAFRMLRVBGyDVSSDQLTQF 

E^SI£LYKASQlITHPDESVLENINSWTSRPLKHGLSSDSVWSDRTDSVVKQEAVt2ALE 
FPYNATI^RLISKRAMESYSGDIVRISKSPYACLNFGHQDFLEIAVEDFNTLQRIHIiKEL 
EELQRWWENKLDELKPPRLHIiGy CYFAAAATLTDPELHDARI AWAQNGVLTTWDDFYD 
GGGSBEEU3NLIELVEKWDPDGEVGYCSKDVEIVFIJVIJiSTVCEIGRRALVW0GRS\^^ 
\n:DGWLAt«IJCVMRKEABMSTNKVVPSI^EYMEQAHVSFAI^PIILPha^ 
GSCEYQKLYKLI4STAGRLKNDIRSYDRECKEGKLNILSLWMIDGGGNVTKEEAIEAI KGD 
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FERAIRELLGLVLQENTTIPRACKOLFWKLMSIVNLFYMEDDGYTSNRLMNT^ 
MDLDALLNK 789 

(2) INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26S8 base pairs 

(B) TYPE; nucleic acid 

(C) TOPOLOGYt linear 
(ii) MOLECULE TYPE: CDNA 

(iii) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



GAAAGCGGTA 


GTTTTCGGCT 


CTTGGCTGTT 


TTTGTGTACT 


GCCTATTCTG 


TTCTTTATAT 


GCACTGAATG 


TGAAAATGCC 


TCACCAGAGT 


TTGATGCTGC 


TTCCGCAATT 


TCTTTGCATG 


TCCTAAACAT 


ACCAGTCATG 


TATCTTTCCC 


GACCTACCGG 


CGTTGCCCGT 


TTTGCTGCCT 


CCTCCTCCTC 


CTCCTCCTCA 


GCTTCCCTAT 


TCCCTGGAGT 


GGATGTGGAC 


ACAACAACTA 


AAACTGGAGC 


TTTGCACTTT 


GAAGAAACAA 


AA6AAAGAAT 


TAAGAAATTG 


TTCGACAAG6 


TTGAACTTTC 


AGTTTCTGCA 


TATGATACTG 


CATGGGTGGC 


AATGGTTCCT 


TCTCCAAACT 


CTCTCAACCA 


ACCTCTTTTC 


CCCGAGTGTA 


TAAACTGGGT 


ATTAGATAGT 


CAACATGCTG 


ATGGCTCATG 


GGGCCTACTC 


CACAACGATC 


AGTTGCTGAT 


GAAGGCCAAT 


CTCTTAXCTA 


CATTAGCATG 


TGTTCTTACT 


CTTAAGCGGT 


GGAATATTGG 


GCATGATCAT 


ATGAGCAAGG 


CCCTTGATTT 


TATCAAGTCT 


AATATAGCTT 


CAGCTACTGA 


TGAGAACCAA 


CGTTCTCCGG 


TGGGATTTGA 


CATTATtTTC CCTGGCATGA 


TTGAGTATGC 


TAAAGACTTG 


AATTTGAATC 


TACCCTTGGC 


ACCGACGAAC GTGGATGCCT 


TGGTTCGAAA 


GAAAGAGTTG 


GAGCTGAGAA 


GCTGCAGAAG 


CAACTCTGAA 


GGTGGAAAAG 


CCTATTTAGC 


GTATGTTTCA 


GAAGGAATTG 


GAAAGTTACA 


GGACTGGGAT 


ATGGTCATGC 


AATATCAAAG 


GAAGAATGGA 


TCACTGTTTA 


ATTCTCCATC 


CACTACGGCA 


GCGGCTTTTA 


TGCATAGAAA 


TGATGATGGC 


TGTTTTGATT 
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ATCTTCGCTC ACTCTTACAA AAGTTTGATG 
TATATGCTCG ATTACACATG GTTGATAGCC 
AAGAGGAGAT TAGAAGCGTA TTAGATGAAA 
ATATATTCTT AGATGCTTCA ACTTGTGCAA 
ATGATGTTTC TTCAGACCAG TTGACTCAAT 
GAGGATATTT AAAAGACTTC GGTGCCTCGC 
CGCACCCCGA TGAATCTGTT CTGGAAAATA 
AXGGATTATC TAGTGATTCA GTTTGGTCTG 
CTGTTAATGC TCTTGAGTTC CCCTATAATG 
CAATGGAAAG TTACAGTGGA GACATTGTGA 
ATTTTGGCCA TCAAGATTTT CTGGAACTTG 
TTCATCTTAA AGAACTGGAA GAGCTTCAAA 
TGAAATTTTT CAGACTGCAC CTAGGGTACT 
ATCCTGAACT TCATGATGCT CGCATAGCAT 
TTGATGATTT CTATGATGGT GGAGGATCTG 
TGGAAAAGTG GGATCCTGAT GGGGAAGTGG 
TTCTTGCACT GCACAGCACA GTTTGTGAAA 
GCAGTGTTAT GAGGAATGTT ATCGATGGTT 
AAGCTGAATG GTCGACAAAT AAGGTAGTGC 
ATGTATCATT CGCGTTGGGA CCTATAATCC 
TCTCAGAGGA. AATGATTGGA AGCTGTGAAT 
CTGGTCGCCT TAAGAATGAT ATTCGATCTT 
ATATTCTGTC TCTGTGGATG ATTGATGGCG 
AAGCAATTAA AGGGGATTTT GAGAGGGCGA 
AGAACACTAC AATTCCAAGA GCTTGTAAGG 
ATCTATTTTA CATGGAAGAT GATGGGTACA 



GCTCAGTTCC CACAATATAT CCTCTTGATA 
TTCAAAAATT CGGAATTGCT CGGCATTTCA 
CTTACAGGTG TTGGATGCAA GGAGAGGAAA 
TGGCCTTTCG AATGTtACGT GTTGAAGGAT 
TTTCAGAAGA TATCTTTCCC AATTGCCTTG 
TGGAGTTATA TAAGGCCTCT CAGATTATCA 
TAAACTCTTG GACTAGTCGT TTCCTGAAGC 
ATAGAACCGA TAGTGTTGTT AAACAAGAGG 
CAACTCTAGA ACGCCTAATA AGTAAGAGGG 
GGATTTCAAA ATCGCCATAT GCCTGCTTAA 
CTGTAGAGGA TTTCAATACC CTGCAACGCA 
GATGGGTGGT TGAAAACAAA TTGGACGAGT 
GCTATTTTGC TGCGGCAGCG ACCCTTACTG 
GGGCACAAAA TGGTGTGCTC ACGACCGTGG 
AAGAGGAATT GGATAACCTT ATAGAATTGG 
GTTACTGTTC CAAGGACGTT GAGATTGTAT 
TAGGAAGAAG AGCTTTAGTA TGGCAAGGAC 
GGTTGGCTCT GCTGAAGGTG ATGAGAAAGG 
catcaatggg TGAATATATG gaacaagccc 

TTCCAATGCT CTTCTTTGTT GGACCTAAAC 
ACCAGAAGTT ATATAAGCTG atgagcactg 
aggatagaga atgcaaagag ggaaagctga 
gtggtaatgt caccaaagag gaggccattg 
taagagagct gctggggtta gttttgcagg 
atttgttctg gaaattgatg tccattg1ga 
cttcaaatag gttgatgaac actgtaaaag 
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10 



IS 



20 



25 



30 



35 



40 



45 



50 



CCATCTTTGA ACAACCCATG GATCTGGATG CACTATTGAA CAAATGAACG AACAACAAAC 
GAAATAAGTT GTAGTGGGAA GCAGACGCCA CCTATTCACT GTCTTCTCTA CTGATCAGAT 
GTCTATTGCT TTCTCATTGA GGAAATGGCA AATAAAACAT TTTATGGAAA CATTTCATTG 
TTAAATTGAG ACAGTCTG 

(2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 26Sd base pairs 

(B) TYPE: nucleic acid 

(C) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 

(iii) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

GAAAGCGGTA GTTTTCGGCT CTTGGCTGTT TTTGTGTACT GCCTATTCTG TTCTTTATAT 60 

GCACTGAATG TGAAAATGCC TCACCAGAGT TTGATGCTGC TTCCGCAATT TCTTTGCATG 120 

TCCTAAACAT ACCAGTC ^^"7 

ATG TAT CTT TCC CGA CCT ACC GGC GTT GCC CGT TTT GCT GCC TCC 182 
Met Tyr Leu Ser Arg Pro Thr Gly Val Ala Axg Phe Ala Ala Ser 

TCC TCC TCC TCC TCC TCA GCT TCC CTA TTC CCT GGA GTG GAT GTG 227 
Ser Ser Ser Ser Ser Ser Ala Ser Leu Phe Pro Gly Val Asp Val 

GAC ACA ACA ACT AAA ACT GGA GCT TTG CAC TTT GAA GAA ACA AAA 
Asp Thr Thr Thr Lys Thr Gly Ala Leu His Phe Glu Glu Thr Lys 

GAA AGA ATT AAG AAA TTG TTC GAC AAG GTT GAA CTT TCA GTT TCT 317 
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Glu Arg lie Lys ty$ Leu Phe Asp Lys Val Glu Leu Ser Val Ser 

5 

GCA TAT GAT ACT GCA TGG GTG GCA ATG GTT CCT TCT CCA AAC TCT 
Ala Thy Asp Thr AZa Trp Val Ala Met Val Pro Ser Pro Asn Ser 

10 

CTC AAC CAA CCT CTT TTC CCC GAG TGT ATA AAC TGG GTA TTA GAT 407 
,5 Leu Asn Gin Pro Leu Phe Pro Glu Cys lie Asn Trp Val Leu Asp 

AGT CAA CAT GCT GAT GGC TCA TGG GGC CTA CTC CAC AAC GAT CAG 

20 

Ser Gin His Ala Asp Gly Ser Trp Gly Leu Leu His Asn Asp Gin 



25 TTG CTG ATG AAG GCC AAT CTC TTA TCT ACA TTA GCA TGT GTT CTT 497 

Leu Leu Met Lys Ala Asn Leu Leu Ser Thr Leu Ala Cys Val Leu 

30 

ACT CTT AAG CGG TGG AAT ATT GGG CAT GAT CAT ATG AGC AAG GCC 
Thr Leu Lys Arg Trp Asn He Gly His Asp His Met Ser Lys Ala 

35 

CTT GAT TTT ATC AAG TCT AAT ATA GCT TCA GCT ACT GAT GAG AAC 587 
Leu Asp Phe He Lvs Ser Asn lie Ala Ser Ala Thr Asp Glu Asn 

40 

CAA CGT TCT CCG GTG GGA TTT GAC ATT ATT TTC CCT GGC ATG ATT 
^ Gin Arg Ser Pro Val Gly Phe Asp He He Phe Pro Gly wet He 



50 GAG TAT GCT AAA GAC TTG AAT TTG AAT CTA CCC TTG GCA CCG ACG 

Glu Tyr Ala Lys Asp Leu Asn Leu Asn Leu Pro Leu Ala Pro Thr 



677 
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AAC GTG GAT GCC TTG GTT CGA AAG AAA GAG TTG GAG CTG AGA AGC 
Asn Val Asp Ala Leu Val Arg Lys Lys GIu Leu GIu Leu Arg Ser 

TGC AGA AGC AAC TCT GAA GGT GGA AAA GCC TAT TTA GCG TAT GTT 
Cys Arg Ser Asn Ser GIu Gly Gly Lys Ala Tyr Leu Ala Tyr Val 

TCA GAA GGA ATT GGA AAG TTA CAG GAC TGG GAT ATG GTC ATG CAA 
Ser Glu Gly lie Gly Lys Leu Gin Asp Trp Asp Met Val Met. Gin 

TAT CAA AGG AAG AAT GGA TCA CTG TTT AAT TCT CCA TCC ACT ACG 
Tyr Gin Arg Lys Asn Gly Ser Leu Phe Asn ser Pro Ser Thr Thr 



GCA GCG GCT TTT ATG CAT AGA AAT GAT GAT GGC TGT TTT GAT TAT 
Ala Ala Ala Phe Met His Arg Asn Asp Asp Gly Cys Phe Asp Tyr 



CTT CGC TCA CTG TTA CAA AAG TTT GAT GGC TCA GTT CCC ACA ATA 
Leu Arg Ser Leu Leu Gin Lys Phe Asp Gly Ser Val Pro Thr lie 

TAT CCT CTT GAT ATA TAT GCT CGA TTA CAC ATG GTT GAT AGC CTT 
Tyr Pro Levi Asp lie Tyr Ala Arg Leu His Met Val Asp Ser Leu 

CAA AAA TTC GGA ATT GCT CGG CAT TTC AAA GAG GAG ATT AGA AGC 
Gin Lys Phe Gly Xle Ala Arg His Pha Lys Glu Glu lie Arg Ser 

GTA TTA GAT GAA ACT TAC AGG TGT TGG ATG CAA GGA GAG GAA AAT 
Val Leu Asp Glu Thr Tyr Arg Cys Trp Met Gin Gly Glu Glu Asn 



767 
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947 
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AtA TTC TTA GAT GCT TCA ACT TGT GCA ATG GCC TTT CGA ATG TTA 
lie Phe Leu Asp Ala Ser Thr Cys Ala Met Aid P'ne Arg M«t: Leu 

CGT GTT GAA GGA TAT GAT GTT TCT TCA GAC CAG TTG ACT CAA TTT 
Arg Vai Glu Gly Tyr Asp Val Ser Ser Asp Gin Leu Thr Gin Phe 

TCA GAA GAT ATC TTT CCC AAT TGC CTT GGA GGA TAT TTA AAA GAC 
Ser Giu Asp lie Phe Pro Asn Cys Leu Gly Gly Tyr Leu Lys Asp 

TTC GGT GCC TCG CTG GAG TTA TAT AAG GCC TCT CAG ATT ATC ACG 
Phe Gly Ala Ser Leu Glu Leu Tyr Lys Ala Ser Gin lie lie Thr 

CAC CCC GAT GAA TCT GTT CTG GAA AAT ATA AAC TCT TGG ACT AGT 
His Pro Asp Glu Ser Val Leu Glu Asn lie Asn Ser Trp Thr Ser 

CGT TTC CTG AAG CAT GGA TTA TCT AGT GAT TCA GTT TGG TCT GAT 

Arg Phe Leu Lys His Gly Leu Ser Ser Asp Ser Val Trp Ser Asp 

AGA ACC GAT AGT GTT GTT AAA CAA GAG GCT GTT AAT GCT CTT GAG 
Arg Thr Asp $er Val Val Lys Gin Glu Ala Val Asn Ala Leu Glu 

TTC CCC TAT AAT GCA ACT CTA GAA CGC CTA ATA AGT AAG AGG GCA 
Phe Pro Tyr Asn Ala Thr Leu Glu Arg Leu He Ser Lys Arg Ala 

ATG GAA AGT TAC AGT GGA GAC ATT GTG AGO ATT TCA AAA TCG CCA 
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Met Glu Ser Tyr Ser Giy Asp lie Val Arg lie Ser Lys Ser Pro 

TAT GCC TGC TTA AAT TTT GGC CAT CAA GAT TTT CTG GAA CTT GCT 
Tyr Ala Cys Leu Asn Phe Gly His Gin Asp Phe Leu Glu Leu Ala 

GTA GAG GAT TTC AAT ACC CTG CAA CGC ATT CAT CTT AAA GAA CTG 
Val Glu Asp Phe Asn Thr Leu Gin Arg lie His Leu Lys Glu Leu 

GAA GAG CTT CAA AGA TGG GTG GTT GAA AAC AAA TTG GAC GAG TTG 
Glu Glu L«u Gin Arg Trp Val Val Glu Asn Lys Leu Asp Glu Leu 

AAA TTT TTC AGA CTG CAC CTA GGG TAC TGC TAT TTT GCT GCG GCA 
Lys Phe Phe Arg Leu His Leu Gly Tyr Cys Tyr Phe Ala Ala Ala 

GCG ACC CTT ACT GAT CCT GAA CTT CAT GAT GCT CGC ATA GCA TGG 
Ala Thr Leu Thr Asp Pro Glu Leu His Asp Ala Arg He Ala Trp 

GCA CAA AAT GGT GTG CTC ACG ACC GTG GTT GAT GAT TTC TAT GAT 
Ala Gin Asn Gly Val Leu Thr Thr Val val Asp Asp Phe Tyr Asp 

GGT GGA GGA TCT GAA GAG GAA TTG GAT AAC CTT ATA GAA TTG GTG 
Gly Gly Gly Ser Glu Glu Glu Leu Asp Asn Leu He Glu Leu Val 

GAA AAG TGG GAT CCT GAT GGG GAA GTG GGT TAC TGT TCC AAG GAC 

Glu Lys Trp Asp Pro Asp Gly Glu Val Gly Tyr Cys Ser Lys Asp 
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GTT GAG ATT GTA TTT CTT GCA CTG CAC AGC ACA GTT TGT GAA ATA 
Val Glu He val Phe Leu Ala Leu His Ser Thr Val Cys Glu lie 

GGA AGA AGA GCT TTA GTA TGG CAA GGA CGC AGT GTT ATC AGG AAT 1937 
Gly Arg Arg Ala Leu Val Trp Gin Gly Arg Ser Val Met Arg Asn 

GTT ATC GAT GGT TGG TTG GCT CTG CTG AAG GTG ATG AGA AAG GAA 
Val He Asp Gly Trp Leu Ala Leu Leu Lys Val Met Arg Lys Glu 



20 GCT GAA TGG TCG ACA AAT AAG GTA GTG CCA TCA ATG GGT GAA TAT 2027 

Ala Glu Trp Ser Thr Asn Lys Val Val Fro Ser Met Gly Glu Tyr 



ATG GAA CAA GCC CAT GTA TCA TTC GCG TTG GGA CCT ATA ATC CTT 
Met Glu Gin Ala His Val Ser Phe Ala Leu Gly Pro He He l«u 

CCA ATG CTC TTC TTT GTT GGA CCT AAA CTC TCA GAG GAA ATG ATT 2117 
Pro Met Leu Phe phe Val Gly Pro Lys Leu ser Giu Giu Met He 



GGA AGC TGT GAA TAC CAG AAG TTA TAT AAG CTG ATG AGC ACT GCT 
40 Gly Ser Cys Glu Tyr Gin Lys Leu Tyr Lys Leu Met Ser Thr Ala 



GGT CGC CTT AAG AAT GAT ATT CGA TCT TAC GAT AGA GAA TGC AAA 2207 
Gly Arg Leu Lys Asn Asp He Arg Ser Tyr Asp Arg Glu Cys Lys 



50 GAG GGA AAG CTG AAT ATT CTG TCT CTG TGG ATG ATT GAT GGC GGT 

Glu Gly Lys Leu Asn He Leu ser Leu Trp Met He Asp Gly Gly 
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GGT AAT GTC ACC AAA GAG GAG GCC ATT GAA GCA ATT AAA GGG GAT 2297 
Gly Asn Val Thr Lys Glu Glu Ala He Glu Ala He Lys Gly Asp 

TTT GAG AGG GCG ATA AGA GAG CTG CTG GGG TTA GTT TTG CAG GAG 
Phe Glu Arg Ala He Arg Glu Leu Leu Gly Leu Val Leu Gin Glu 

AAC ACT ACA ATT CCA AGA GCT TGT AAG GAT TTG TTC TGG AAA TTG 2387 
Asn Thr Thr He Pro Arg Ala Cys Lys Asp Leu Phe Trp Lys Leu 

ATG TCC ATT GTG AAT CTA TTT TAG ATG GAA GAT GAT GGG TAG ACT 
Met Ser He Val Asn Leu Phe Tyr Met Glu Asp Asp Gly Tyr Thr 

TCA AAT AGG TTG ATG AAC ACT GTA AAA GCC ATG TTT GAA CAA CCC 2477 
Ser Asn Arg Leu ^5et Asn Thr Val Lys Ala Met Phe Glu Gin Pro 

ATG GAT CTG GAT GCA CTA TTG AAC AAA TGA 2507 
Met Asp Leu Asp Ala Leu Leu Asn Lys ♦** 

ACGAACAACA AACGAAATAA GTTGTAGTGG GAAGCAGACG CCACCTATTC ACTGTCTTCT 2567 

CTACTGAXCA GATGTCTATT GCTTTCTCAT TGAGGAAATG GCAAATAAAA CATTTTATGG 2627 

AAACATTTCA TTGTTAAATT GAGACAGTCT G 2658 

Claims 

1. A polypeptide which is characterized by amino acid Nos. 1 -789 of the amino acid sequence as defined by SEQ ID 
N0:1 of the sequence listing. 

2. A polypeptide which has a catalytic activity of converting copalyl pyrophosphate to ent-kaurene wherein said 
polypeptide has the amino acid sequence as defined by amino acid Nos. 1-789 of SEQ ID NO:1 of the sequence 
listing in which one or more constituent amino acid residues are inserted, delated, and/or substituted. 

3. The polypeptide according to claim 2 which is a naturally-derived enzyme expressed in a plant body. 
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4. A polypeptide which contains a polypeptide according to any one of claims 1 to 3 in the molecule thereof and has 
a catalytic activity of converting copalyl pyrophosphate to ent-kaurene. 

5. A polypeptide which consists of or comprises as a part thereof a partial polypeptide being derived from any one of 
the polypeptides according to any one of claims 1 to 3 and being an active domain catalyzing the conversion of 
copalyl pyrophosphate to ent-kaurene. 

6. A DNA which encodes the polypeptide according to claim 1 . 

7. The DNA according to daim 6 which is characterized by the nucleotide Nos. 1 38-2507 of the nucleic acid sequence 
as defined by SEQ ID NO:3 of the sequence listing. 

8. A DNA which encodes a polypeptide having an enzymatic activity of converting copalyl pyrophosphate to ent-kau- 
rene and which has the DNA sequence according to claim 7 and contains Insertion, deletion, and/or substitution of 
one or more of the constituent nucleotides. 

9. An anti-sense DNA against an ent-kaurene B synthase gene comprising the DNA according to claim 6 wherein said 
anti-sense DNA has a DNA sequence complimentary to the DNA according to claim 7 and contains Insertion, dele- 
tion, and/or substitution of one or more of constituent nucleotkies. 

10. A recombinant vector which contains a DNA according to any one of claims 6 to 9. 

11. A transformant which contains a recombinant vector according to daim 10. 
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